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Amendments To The Claims: 

This listing of claims will replace all prior versions and listings of claims in the application: 
Claims 1-111. (cancelled) 

Claim 1 12. (currently amended) A method comprising: 

(a) providing first data from a first set of samples wherein: 

(i) the first set of samples comprises a plurality of samples classified into a 
first biological state class and a plurality of samples classified into a second 
biological state class; 

(ii) the data from the first set of samples comprises a plurality of data 
elements, each data element characterized by a value, wherein all of the samples 
share a plurality of common data elements; 

(b) performing multivariate analysis on the first data to qualify each common data 
element in the first data based on the ability of the data element to classify a sample into 
the first biological state class or the second biological state class, wherein data element 
values are qualified using a classification model classification is a function of data 
e l e m e nt value ; 

(c) selecting a first subset of qualified common data elements from the first data; 

(d) providing second data from a second set of samples wherein: 

(i) the second set of samples comprises a plurality of samples classified into 
the first biological state class and a plurality of samples classified into the second 
biological state class; 

(ii) the data from the second set of samples comprises a plurality of data 
elements, each data element characterized by a value, wherein all of the samples 
share the plurality of common data elements; 

(iii) the firs t set of samples and second set of samples come from first and 
second populations that have a statistically significant difference with respect to 
at least one preanalytical variable; 
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(e) performing multivariate analysis on the second data to qualify each common data 
element in the second data based on the ability of the data element to classify a sample 
into the first biological state class or the second biological state class, wherein data 
element values are qualified using a classification model wh e rein classification is q 
function of data e l e ment valu e; 

(f) selecting a second subset of qualified common data elements from the second 
data; 

(g) selecting an intersection subset of data elements from the first and second subsets, 
wherein each data element in the intersection subset is a member of both of the first and 
second subsets; and 

(h) displaying the intersection subset on a graphical display interface on a user 
device. 

Claim 113. (previously presented) The method of claim 112 wherein the first and 
second populations have a statistically significant difference with respect to a preanalytical 
variable selected from the group consisting of gender, age, ethnicity, sample collection 
parameter, sample processing parameter, weight, diet, medication status, medical condition, 
amount of physical exercise, pregnancy, level of circulating antibodies and a clinical 
characteristic. 

Claim 1 14. (previously presented) The method of claim 1 13 wherein the first and 
second populations have a statistically significant difference with respect to a plurality of 
preanalytical variables selected from said group. 

Claims 115. (previously presented) The method of claim 1 12 wherein the first samples 
and the second samples are collected from different geographical locations. 

Claims 1 16. (previously presented) The method of claim 112 wherein the first samples 
and the second samples are collected from different clinical trial sites. 
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Claim 117. (previously presented) The method of claim 1 12 wherein the step of 
selecting the first and second subsets comprises using the discovery data sets to train a learning 
algorithm wherein the learning algorithm ranks the data elements based on a quantitative 
measure of ability to classify. 

Claim 118. (previously presented) The method of claim 1 1 7 wherein the 

learning algorithm is a supervised learning algorithm. 

Claim 119. (Canceled) 

Claim 1 20. (previously presented) The method of claim 1 1 7 wherein the 

training comprises using support vector machine analysis. 

Claim 121. (Canceled) 

Claim 122. (Canceled) 

Claim 123. (previously presented) The method of claim 1 12 further comprising 

independently re-sampling data elements in each data set. 

Claim 124. (previously presented) The method of claim 1 12 further 
comprising, selecting candidate biomarkers from selected data elements and testing one or more 
of the candidate biomarkers on a validation data set. 

Claim 125. (Canceled) 

Claim 126. (Canceled) 

Claim 127. (previously presented) The method of claim 1 12 wherein the 

biological state class is selected from the group consisting of: presence of a disease; absence of a 
disease; progression of a disease; risk for a disease; stage of disease; likelihood of recurrence of 
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disease; a genotype; a phenotype; exposure to an agent or condition; a demographic 
characteristic; resistance to agent, sensitivity to an agent, and combinations thereof. 



Claim 128. (Canceled) 
Claim 129. (Canceled) 
Claim 130. (Canceled) 



Claim 131. (previously presented) The method of claim 1 24 wherein the one or 

more candidate biomarkers are diagnostic of the presence of a disease, risk of developing a 
disease, risk of recurrence of a disease, or stage of the disease. 

Claim 1 32. (previously presented) The method of claim 1 12 wherein values of 
the data elements in a data point represent levels and/or frequency of components in a data point 
sample. 



Claim 133. (previously presented) The method of claim 1 32 wherein 

components are selected from the group consisting of: nucleic acids, proteins, polypeptides, 
peptides, carbohydrates and modified or processed forms thereof. 

Claim 1 34. (previously presented) The method of claim 1 12 wherein levels of 

components are measured by an expression profiling assay. 



Claim 135. (Canceled) 
Claim 136. (Canceled) 

Claim 137. (previously presented) The method of claim 1 34 wherein the 

expression profiling assay comprises measuring the amount and/or form of a protein, 
polypeptide or peptide. 
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Claim 138. (previously presented) The method of claim 1 37 wherein the 
expression profiling assay comprises mass spectrometry. 

Claim 139. (previously presented) The method of claim 138 wherein the 

expression profiling assay comprises SELDI analysis. 

Claim 140. (Canceled) 

Claim 141 . (previously presented) The method of claim 1 12 wherein 

expression profiling comprises: 

(a) contacting samples with a substrate comprising binding partners for 
specifically binding to sample components having selected characteristics and 

(b) identifying sample components bound to the substrate. 

Claim 1 42. (previously presented) The method of claim 1 4 1 wherein binding 

partners are selected from the group consisting of cationic molecules; anionic molecules; metal 
chelates; antibodies; single- or double-stranded nucleic acids; proteins, peptides, amino acids; 
carbohydrates; lipopolysaccharides; sugar amino acid hybrids; molecules from phage display 
libraries; biotin; avidin; streptavidin; and combinations thereof. 

Claim 143. (previously presented) The method of claim 141 wherein the 

binding partners are arrayed on the substrate. 

Claim 1 44. (previously presented) The method of claim 1 1 7 wherein an assay 

used to measure levels of data elements in training data sets from which candidate biomarkers 
are identified is different from an assay used to measure data elements in a validation data set 
used to validate the candidate biomarker. 

Claim 145. (previously presented) The method of claim 140 wherein the assay 

used to measure levels of data elements in training data sets is SELDI. 
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Claim 146. (Canceled) 

Claim 147. (previously presented) The method of claim 1 12 wherein the 

independent discovery data sets are collected from different locations, using different collection 
protocols, and/or are collected from different populations. 

Claim 148. (previously presented) The method of claim 1 12 wherein each 

discovery data set is from a different clinical trial site. 

Claim 1 49. (currently amended) A computer program product comprising a written, 
electronic, magnetic or optical physical media that is computer readable and having: 

(a) receiving input data of at least first and second independent discovery data 
sets wherein: 

(i) the data sets comprise a plurality of forms of biological state classes; 

(ii) each data set comprises a plurality of data points, wherein each data 
point exhibits one form of a biological state class and each data set 
comprises a plurality of data points belonging to each of the classes; 
and 

(iii) each data point comprises a plurality of data elements, each data 
element characterized by a value, wherein all data points share a 
plurality of common data elements; 

(b) a second computer readable program code providing instructions for 
qualifying each common data element, independently for each data set, based on the 
ability of the data element to classify a data point into a biological state class, wherein 
the ability of the data element to classify a data point into a biological state class is as-a 
function of data element value and for selecting an initial subset of data elements within 
each data set, and 

(c) a third computer readable program code providing instructions for 
selecting an intersection subset of data elements from the initial subsets, wherein each 
data element in the intersection subset is a member of a majority of the initial subsets. 



Z. Zhang et al. 
U.S.S.N. 10/635,241 
Page 8 

Claim 1 50. (previously presented) The computer program product of claim 149 

wherein selecting the initial subsets comprises using the discovery data sets to train a learning 
algorithm wherein the learning algorithm ranks the data elements based on a quantitative 
measure of ability to classify. 

Claim 151. (previously presented) The computer program product of claim 1 49 

wherein the learning algorithm is a supervised learning algorithm. 

Claim 1 52. (previously presented) The computer program product of claim 149 

wherein the learning algorithm is an unsupervised learning algorithm. 

Claim 153. (previously presented) The computer program product of claim 1 50 

wherein training comprises support vector machine analysis. 

Claim 154. (Canceled) 

Claim 155. (Canceled) 

Claim 156. (Canceled) 

Claim 1 57. (previously presented) The computer program product of claim 149 

further comprising program code for independently re-sampling data elements in each data set. 

Claim 158. (previously presented) The computer program product of claim 149 

further comprising program code for selecting candidate biomarkers based on ranking by the 
learning algorithm and for testing one or more of the candidate biomarkers on a validation data 
set. 

Claim 159. (Canceled) 
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Claim 160. (Canceled) 

Claim 161. (previously presented) The computer program product of claim 149 
wherein the biological state class is selected from the group consisting of: presence of a disease; 
absence of a disease; progression of a disease; risk for a disease; stage of disease; likelihood of 
recurrence of disease; a genotype; a phenotype; exposure to an agent or condition; a 
demographic characteristic; resistance to agent, sensitivity to an agent, and combinations 
thereof. 

Claim 162. (Canceled) 
Claim 163. (Canceled) 
Claim 164. (Canceled) 

Claim 1 65. (previously presented) The computer program product of claim 158 

wherein the one or more candidate biomarkers are diagnostic of the presence of a disease, risk of 
developing a disease, risk of recurrence of a disease, or stage of the disease. 

Claim 1 66. (previously presented) The computer program product of claim 1 6 1 

wherein values of the data elements in a data point represent levels and/or frequency of 
components in a data point sample. 

Claim 1 67. (previously presented) The computer program product of claim 1 6 1 
wherein components are selected from the group consisting of: nucleic acids, proteins, 
polypeptides, peptides, carbohydrates and modified or processed forms thereof. 

Claim 168. (previously presented) The computer program product of claim 160 

wherein levels of components are measured by an expression profiling assay. 



Claim 169. (Canceled) 
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Claim 170. (Canceled) 

Claim 171. (previously presented) The computer program product of claim 1 68 

wherein the expression profiling assay comprises measuring the amount and/or form of a 
protein, polypeptide or peptide. 

Claim 1 72. (previously presented) The computer program product of claim 1 68 
wherein the expression profiling assay comprises mass spectrometry. 

Claim 1 73 . (previously presented) The computer program product of claim 168 
wherein the expression profiling assay comprises SELDI analysis. 

Claim 174. (Canceled) 

Claim 1 75 . (previously presented) The computer program product of claim 1 68 

wherein expression profiling comprises: 

(a) contacting samples with a substrate comprising binding partners for 
specifically binding to sample components having selected characteristics; and 

(b) identifying sample components bound to the substrate. 

Claim 1 76. (previously presented) The computer program product of claim 1 75 
wherein binding partners are selected from the group consisting of cationic molecules; anionic 
molecules; metal chelates; antibodies; single- or double-stranded nucleic acids; proteins, 
peptides, amino acids; carbohydrates; lipopolysaccharides; sugar amino acid hybrids; molecules 
from phage display libraries; biotin; avidin; streptavidin; and combinations thereof. 

Claim 1 77. (previously presented) The computer program product of claim 149 

wherein an assay used to measure levels of data elements in training data sets from which 
candidate biomarkers are identified is different from an assay used to measure data elements in a 
validation data set used to validate the candidate biomarker. 
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Claim 1 78. (previously presented) The computer program product of claim 149 
wherein the assay used to measure levels of data elements in training data sets is SELDI. 

Claim 179. (Canceled) 

Claim 1 80. (previously presented) The computer program product of claim 149 

wherein the independent discovery data sets are collected from different locations, using 
different collection protocols, and/or are collected from different populations. 

Claim 181. (previously presented) The computer program product of claim 1 49 
wherein each discovery data set is from a different clinical trial site. 

Claim 182. (currently amended) A system comprising: 
one or more processors for 

(a) receiving input data comprising at least first and second independent 
discovery data sets wherein: 

(i) the first set of samples comprises a plurality of samples classified 
into a first biological state class and a plurality of samples classified into a second 
biological state class; 

(ii) the data from the first sample set comprises a plurality of data 
elements, each data element characterized by a value, wherein all of the samples 
share a plurality of common data elements; 

(iii) the second set of samples comprises a plurality of samples 
classified into the first biological state class and a plurality of samples classified 
into the second biological state class; 

(iv) the data from the second sample set comprises a plurality of data 
elements, each data element characterized by a value, wherein all of the samples 
share the plurality of common data elements; 

(b) executing computer readable program code providing instructions for qualifying 
each common data element, independently for each data set, based on the ability of the 
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data element to classify a data point into a biological state class, wherein data element 
values are qualified using a classification wherein cla s sification i s a function of data 
element value and for selecting an initial subset of data elements within each data set; 
and 

(c) executing computer readable program code providing instructions for selecting an 
intersection subset of data elements from the initial subsets, wherein each data 
element in the intersection subset is a member of a majority of the initial subsets. 

Claim 1 83. (previously presented) The system of claim 1 82 further comprising 

one or more devices for providing input data to the one or more processors. 

Claim 1 84. (previously presented) The system of claim 1 82 wherein the one or 

more devices for providing input data comprises a detector for detecting a characteristic of a data 
element. 

Claim 185. (previously presented) The system of claim 182 wherein the 

detector comprises a mass spectrometer. 

Claim 1 86. (previously presented) The system of claim 1 82 wherein the 
detector comprises a gene chip reader. 

Claim 1 87. (previously presented) The system of claim 1 82 further comprising 

a memory for storing a data set of ranked data elements. 

Claim 1 88. (previously presented) The system of claim 1 82 further comprising 

a database of ranked data elements. 

Claim 1 89. (previously presented) The system of claim 1 82 wherein selecting 

the initial subsets comprises using the discovery data sets to train a learning algorithm wherein 
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the learning algorithm ranks the data elements based on a quantitative measure of ability to 
classify. 

Claim 1 90. (previously presented) The system of claim 1 89 wherein the 

learning algorithm is a supervised learning algorithm. 

Claim 191. (Canceled) 

Claim 1 92. (previously presented) The system of claim 1 89 wherein training 

comprises support vector machine analysis. 

Claim 193. (Canceled) 

Claim 194. (Canceled) 

Claim 195. (Canceled) 

Claim 196. (previously presented) The system of claim 182 wherein the system 

further executes program code for independently re-sampling data elements in each data set. 

Claim 1 97. (previously presented) The system of claim 1 89 wherein the system 

further executes program code for selecting candidate biomarkers based on ranking by the 
learning algorithm and for testing one or more of the candidate biomarkers on a validation data 
set. 

Claim 198. (Canceled) 
Claim 199. (Canceled) 

Claim 200. (previously presented) The system of claim 182 wherein the 

biological state class is selected from the group consisting of: presence of a disease; absence of a 
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disease; progression of a disease; risk for a disease; stage of disease; likelihood of recurrence of 
disease; a genotype; a phenotype; exposure to an agent or condition; a demographic 
characteristic; resistance to agent, sensitivity to an agent, and combinations thereof. 



Claim 201. (Canceled) 
Claim 202. (Canceled) 
Claim 203. (Canceled) 



Claim 204. (previously presented) The system of claim 197 wherein the one or 

more candidate biomarkers are diagnostic of the presence of a disease, risk of developing a 
disease, risk of recurrence of a disease, or stage of the disease. 

Claim 205. (previously presented) The system of claim 1 82 wherein values of 

the data elements in a data point represent levels and/or frequency of components in a data point 
sample. 

Claim 206. (previously presented) The system of claim 205 wherein 

components are selected from the group consisting of: nucleic acids, proteins, polypeptides, 
peptides, carbohydrates and modified or processed forms thereof. 

Claim 207. (previously presented) The system of claim 205 wherein levels of 
components are measured by an expression profiling assay. 



Claim 208. (Canceled) 
Claim 209. (Canceled) 
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Claim 210. (previously presented) The system of claim 207 wherein the 

expression profiling assay comprises measuring the amount and/or form of a protein, 
polypeptide or peptide. 

Claim 211. (previously presented) The system of claim 207 wherein the 
expression profiling assay comprises mass spectrometry. 

Claim 2 12. (previously presented) The system of claim 214 wherein the 

expression profiling assay comprises SELDI analysis. 

Claim 213. (Canceled) 

Claim 214. (previously presented) The system of claim 207 wherein 

expression profiling comprises: 

(a) contacting samples with a substrate comprising binding partners for 
specifically binding to sample components having selected characteristics and 

(b) identifying sample components bound to the substrate. 

Claim 215. (previously presented) The system of claim 2 1 4 wherein binding 

partners are selected from the group consisting of cationic molecules; anionic molecules; metal 
chelates; antibodies; single- or double-stranded nucleic acids; proteins, peptides, amino acids; 
carbohydrates; lipopolysaccharides; sugar amino acid hybrids; molecules from phage display 
libraries; biotin; avidin; streptavidin; and combinations thereof. 

Claim 216. (previously presented) The system of claim 1 82 wherein an assay 

used to measure levels of data elements in training data sets from which candidate biomarkers 
are identified is different from an assay used to measure data elements in a validation data set 
used to validate the candidate biomarker. 

Claim 217. (previously presented) The system of claim 2 1 6 wherein the assay 

used to measure levels of data elements in training data sets is SELDI. 
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Claim 218. (Canceled) 

Claim 2 1 9. (previously presented) The system of claim 1 82 wherein the 

independent discovery data sets are collected from different locations, using different collection 
protocols, and/or are collected from different populations. 

Claim 220. (previously presented) The system of claim 182 wherein each 
discovery data set is from a different clinical trial site. 

Claim 22 1 . (Previously presented) The method of claim 1 1 2 wherein the 
multivariate analysis on the first data comprises use of a pattern recognition process. 

Claim 222. (currently amended) The method of claim 221 wherein the pattern 
recognition process comprises us e of a classification model. 

Claim 223. (Previously presented) The method of claim 112 wherein the 

multivariate analysis on the second data comprises use of a pattern recognition process. 

Claims 224. (currently amended) The method of claim 223 wherein the pattern 
recognition process comprises u se of a classification model. 
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